CDS

Accession Number TCMCG075C19793
gbkey CDS
Protein Id XP_017979352.1
Location join(21040387..21040587,21040710..21040883,21041036..21041135,21041449..21041925,21042713..21043435,21044058..21044149,21044599..21044672,21044845..21045067,21045258..21045351,21045501..21045655,21045964..21046188,21046739..21046939,21047397..21047522,21047600..21047661,21048035..21048152,21048436..21048720,21048808..21048846)
Gene LOC18596361
GeneID 18596361
Organism Theobroma cacao

Protein

Length 1122aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018123863.1
Definition PREDICTED: peroxisome biogenesis protein 1 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category O
Description Peroxisome biogenesis protein
KEGG_TC 3.A.20.1
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K13338        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04146        [VIEW IN KEGG]
map04146        [VIEW IN KEGG]
GOs GO:0006082        [VIEW IN EMBL-EBI]
GO:0006605        [VIEW IN EMBL-EBI]
GO:0006625        [VIEW IN EMBL-EBI]
GO:0006629        [VIEW IN EMBL-EBI]
GO:0006631        [VIEW IN EMBL-EBI]
GO:0006635        [VIEW IN EMBL-EBI]
GO:0006810        [VIEW IN EMBL-EBI]
GO:0006886        [VIEW IN EMBL-EBI]
GO:0006996        [VIEW IN EMBL-EBI]
GO:0007031        [VIEW IN EMBL-EBI]
GO:0008104        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009056        [VIEW IN EMBL-EBI]
GO:0009062        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0015031        [VIEW IN EMBL-EBI]
GO:0015833        [VIEW IN EMBL-EBI]
GO:0015919        [VIEW IN EMBL-EBI]
GO:0016042        [VIEW IN EMBL-EBI]
GO:0016043        [VIEW IN EMBL-EBI]
GO:0016054        [VIEW IN EMBL-EBI]
GO:0016558        [VIEW IN EMBL-EBI]
GO:0017038        [VIEW IN EMBL-EBI]
GO:0019395        [VIEW IN EMBL-EBI]
GO:0019752        [VIEW IN EMBL-EBI]
GO:0030258        [VIEW IN EMBL-EBI]
GO:0032787        [VIEW IN EMBL-EBI]
GO:0033036        [VIEW IN EMBL-EBI]
GO:0033365        [VIEW IN EMBL-EBI]
GO:0034440        [VIEW IN EMBL-EBI]
GO:0034613        [VIEW IN EMBL-EBI]
GO:0042886        [VIEW IN EMBL-EBI]
GO:0043436        [VIEW IN EMBL-EBI]
GO:0043574        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044242        [VIEW IN EMBL-EBI]
GO:0044248        [VIEW IN EMBL-EBI]
GO:0044255        [VIEW IN EMBL-EBI]
GO:0044281        [VIEW IN EMBL-EBI]
GO:0044282        [VIEW IN EMBL-EBI]
GO:0044743        [VIEW IN EMBL-EBI]
GO:0045184        [VIEW IN EMBL-EBI]
GO:0046395        [VIEW IN EMBL-EBI]
GO:0046907        [VIEW IN EMBL-EBI]
GO:0051179        [VIEW IN EMBL-EBI]
GO:0051234        [VIEW IN EMBL-EBI]
GO:0051641        [VIEW IN EMBL-EBI]
GO:0051649        [VIEW IN EMBL-EBI]
GO:0055085        [VIEW IN EMBL-EBI]
GO:0055114        [VIEW IN EMBL-EBI]
GO:0065002        [VIEW IN EMBL-EBI]
GO:0070727        [VIEW IN EMBL-EBI]
GO:0071702        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0071705        [VIEW IN EMBL-EBI]
GO:0071806        [VIEW IN EMBL-EBI]
GO:0071840        [VIEW IN EMBL-EBI]
GO:0072329        [VIEW IN EMBL-EBI]
GO:0072594        [VIEW IN EMBL-EBI]
GO:0072662        [VIEW IN EMBL-EBI]
GO:0072663        [VIEW IN EMBL-EBI]
GO:1901575        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGAGTTTGAGGTGAGACACGTGGCAGGAATAGAGGACTGCTTCGTATCTCTTCCACTCCTACTCATCCAAACCCTTCAATCCACGCGCTCTTCTCTCCTCCCTCCCCTTCTCGCTCTCGAGCTTCGCCTCCCACGCTCCTCCGACCACCCCTGGATCGTCGCTTGGTCCGGCGCTGCTTCTTCTTCCACTGCTATTGAGGTTTCTCAACAATTTGCAGAATGTATATCTTTGCCCAATCACACCACAGTTCAAGTACGAGCAGCTTCTAATATGGCAAAGGCTACATTAGTCACAATTGAACCTCATACCGAGGATGATTGGGAAGTTTTAGAGCTTAACTCTGAGCACGCAGAAGCTGCTATATTAAAGCAGGTCAGGATTGTCCATGAAGGAATGCGATTTCCTCTGTGGTTGCATGGCCGCACGATCGTAACTTTCCTAGTGGTTTCAACCTTTCCCAAGAAAGCGGTGGTTCAACTTGTCCCTGGAACAGAAGTTGCTGTTGCTCCAAAGAGACGTGAGAAAAATTTAAACAACATGGAATCGTCTACCAGAGAATCTCATGGTGCAAAAGCACTGCTACGTTTGCAAGATTCGGACAGAAGATTGTTTCACAAAAGCAATGTCAAAGGTGTTGAGCTTGGGGTAGCACTCACTTCTGTCGCCTTTATTCATCAAGTAACAGCTAAAAGATTTTCATTGGAGTCTCTTCAGTTGGTTGTTATAGTGCCAAGATTGTCATCCAAAGGGAGTGTGAAGAATCTGGAAAATGATGCCTTGAGAATGAAAGGAAGTTTAACTTCCAAGGAAGTAAATAGTGGAATTTCAACTGATAATAAGGAATTTCGTCAAGTGATTGTTCACCTTTTAATTTCAGATTCAGTGGCTGAAGGACATGTAATGATTACTCGCTCTCTTCGGCTTTATTTGAGAGCAGGACTACATTCATGGGTTTATTTAAAGGGCTATAATGTTGCTTTGAAGAAGGAAATTTCTGTACTGTCACTTTCTCCATGCCACTTCAAGATGGTTGCAAATGATAAGGAGAATGGTCTTGAAGTGCTTGATGGCCATAAAACTCGTAGGATGAAAAACTCTGGTTCAGGAACCTCTTTAGAGGTAGTAAATTGGTCAACCCATGATGATGTTGTGGCTGTTCTTTCTTCTGAATTTCCTTTCCAAGAGGCTGAAGACTCCAGTCAGGAAGACACTAAAAAGGGCTTAGAATGTCTTCTTCGTGCATGGTTTCTTGCTCAACTTGATGCTATAGCTTCAAATGCAGGGACGGAAGTTAAGACATTGGTTTTGGGGAATGAAAATCTACTTCACTTTGAGGTGAACAGATATGATTCTGGGACTTACGGACTAGTCTCATCTAATGGTTTTTCAGAAAAGAGAAATAAGACTAAGGACTTGCCGGTGGAAATTTCATACATATTGACCATTTCTGAGGAACTACTGCACAGTGGAAATGTTAATGCGTATGAGCTTGCCCTTGATGATAGAAACAAGAGGAATGATGTTCAGGGCGGTTTCGAGTTGTTTGGAAAGCTAAATTTGGGTAACCCTATGTCCCTATATTCTGTTAAAGACAGAACATCTGTCAAGGGGTTTAGCACAAATGCATCTTCATTAAGCTGGATGGGTGTGACTGCTTCTGATGTTATCAATAGAATGATGGTGTTGTTAGCTCCTGCTTCTGGAATTTGGTTTAGTACTTACAATCTTCCTCTCCCAGGACATGTTCTAATATATGGACCTGCGGGTTCTGGAAAGACATTATTGGCTAGAGCTGTTGCAAAGTCCCTTGAAGAACATAAAGACCTGTTAGCACATGTAATCTTTATATGTTGCTCAGGGCTTGCTTTAGAGAAGCCCCCAACCATTCGTCAAGCGCTTTCAAGTTTTGTGTCTGAAGCTCTAGATCATGCACCTTCAGTTGTTGTTTTTGATGATCTTGATAGTATCATCCAATCTTCATCTGACTCAGAAGGATCCCAACCTTCAACCTCAGTTGTTGCACTTACTAAATTTCTCACTGACATTATTGATGAATATGGAGAAAAGAGGAAGAGCTCCTGTGGTATTGGTCCAATAGCTTTTATAGCTTCTGTGCAGTCTCTGGAGAGTATCCCTCAGTCTTTGAGCTCATCAGGAAGGTTTGACTTTCATGTGCAACTACCTGCACCTGCTGCCTCTGAACGTGGGGCCATATTGAAGCATGAAATTCAGAGGCGTTCCCTACAATGTCATGATGACATCTTACTTGATGTAGCTTCCAAATGTGATGGATATGATGCATATGATCTGGAAATATTGGTTGATAGAGCTGTTCATGCCGCCATTGGTCGGTTTTTGCCTTCTGATTCTGAAGAATACGTGAAGCCCATTTTAGTTAGGGAGGATTTCTCTCATGCTATGCATGAGTTCCTTCCAGTTGCCATGCGTGACATTACTAAATCTGCTCCTGAAGTTGGTCGCTCTGGTTGGGATGATGTTGGTGGTCTCAATGACATTCGAGATGCTATCAAAGAGATGATTGAAATGCCTTCAAAGTTTCCGAATATATTTGCACAAGCTCCTTTAAGGTTGCGGTCTAATGTTCTCTTATATGGTCCTCCTGGCTGTGGTAAAACCCACATTGTTGGTGCTGCTGCTGCCGCTTGTTCACTAAGATTTATATCGGTGAAAGGGCCTGAGCTACTGAACAAATACATTGGTGCTTCTGAGCAAGCTGTTCGAGATATTTTTTCAAAGGCAGCTGCTGCAGCGCCATGCCTCCTCTTTTTTGATGAATTTGATTCCATTGCACCTAAAAGAGGGCATGACAACACTGGAGTAACTGATAGAGTTGTTAATCAATTCCTAACAGAATTAGATGGCGTTGAAGTTTTGACTGGTGTATTTGTGTTTGCTGCAACAAGTAGACCAGATCTGCTTGATGCTGCATTGCTGAGACCAGGTAGGCTCGATCGCCTCCTTTTCTGTGATTTTCCATCTCGGCGTGAGAGGTTGGATGTTCTGACTGTTCTTTCTAGAAAGCTACCATTAGCCAGTGATGTTGATTTAGGCGCCATAGCTTGTATGACAGAAGGATTTAGCGGAGCTGATCTCCAAGCTCTTCTCTCAGACGCACAGCTTGCTGCAGTTCATGAACATTTGAGCAGTGTGAGTAGCAATGAGCCTGGAAAAATGCCAGTCATAACTGATGGTGTTTTGAAGTCTATTGCTTCAAAGGCAAGACCATCAGTTTCAGAAACCGAGAAGCAGAGACTTTATGGCATCTACAGTCAGTTTCTGGATTCAAAGAGATCCGTTGCTGCACAGTCAAGGGATGCAAAAGGCAAGAGGGCAACTCTGGCATGA
Protein:  
MEFEVRHVAGIEDCFVSLPLLLIQTLQSTRSSLLPPLLALELRLPRSSDHPWIVAWSGAASSSTAIEVSQQFAECISLPNHTTVQVRAASNMAKATLVTIEPHTEDDWEVLELNSEHAEAAILKQVRIVHEGMRFPLWLHGRTIVTFLVVSTFPKKAVVQLVPGTEVAVAPKRREKNLNNMESSTRESHGAKALLRLQDSDRRLFHKSNVKGVELGVALTSVAFIHQVTAKRFSLESLQLVVIVPRLSSKGSVKNLENDALRMKGSLTSKEVNSGISTDNKEFRQVIVHLLISDSVAEGHVMITRSLRLYLRAGLHSWVYLKGYNVALKKEISVLSLSPCHFKMVANDKENGLEVLDGHKTRRMKNSGSGTSLEVVNWSTHDDVVAVLSSEFPFQEAEDSSQEDTKKGLECLLRAWFLAQLDAIASNAGTEVKTLVLGNENLLHFEVNRYDSGTYGLVSSNGFSEKRNKTKDLPVEISYILTISEELLHSGNVNAYELALDDRNKRNDVQGGFELFGKLNLGNPMSLYSVKDRTSVKGFSTNASSLSWMGVTASDVINRMMVLLAPASGIWFSTYNLPLPGHVLIYGPAGSGKTLLARAVAKSLEEHKDLLAHVIFICCSGLALEKPPTIRQALSSFVSEALDHAPSVVVFDDLDSIIQSSSDSEGSQPSTSVVALTKFLTDIIDEYGEKRKSSCGIGPIAFIASVQSLESIPQSLSSSGRFDFHVQLPAPAASERGAILKHEIQRRSLQCHDDILLDVASKCDGYDAYDLEILVDRAVHAAIGRFLPSDSEEYVKPILVREDFSHAMHEFLPVAMRDITKSAPEVGRSGWDDVGGLNDIRDAIKEMIEMPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAAAAAPCLLFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSRRERLDVLTVLSRKLPLASDVDLGAIACMTEGFSGADLQALLSDAQLAAVHEHLSSVSSNEPGKMPVITDGVLKSIASKARPSVSETEKQRLYGIYSQFLDSKRSVAAQSRDAKGKRATLA